Reducing Ownership Overhead for Load-Store Sequences in Cache-Coherent Multiprocessors

نویسندگان

  • Jim Nilsson
  • Fredrik Dahlgren
چکیده

Parallel programs that modify shared data in a cachecoherent multiprocessor with a write-invalidate coherence protocol create ownership overhead in the form of ownership acquisitions at writes to shared data. This can have a significant impact on performance in a cache-coherent non-uniform memory architecture (NUMA) multiprocessor. By combining a read-request and an ownership acquisition, the write latency and network traffic can potentially be reduced. In this paper we propose a new hardware-based approach for performing this optimization by targeting load-store sequences, which we show is a super-set of migratory sharing. A loadstore sequence consists of a global read request followed by a global write action to the same memory location from the same processor, without any interveaning access to the same block from any other processor. We use detailed simulation with four benchmark programs including one on-line transaction processing (OLTP) workload and operating system execution to examine the effectiveness of the proposed technique. The results show that the technique is able to reduce write-related latency and network traffic more than previous hardware-based techniques, up to twice as much.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Scheduling to Reduce Memory Coherence Overhead on Coarse-grain Multiprocessors 1 Scheduling to Reduce Memory Coherence Overhead on Coarse-grain Multiprocessors

Some Distributed Shared Memory (DSM) and Cache-Only Memory Architecture (COMA) multiprocessors keep processes near the data they reference by transparently replicating remote data in the processes' local memories. This automatic replication of data can impose substantial memory system overhead on an application since all replicated data must be kept coherent. We examine the eeect of task schedu...

متن کامل

Improving Performance of Load-Store Sequences for Transaction Processing Workloads on Multiprocessors

On-line transaction processing exhibits poor memory behavior in high-end multiprocessor servers because of complex sharing patterns and substantial interaction between the database server and the operating system. One contributing source is a large amount of load-store sequences in the program, resulting in many read misses as well as much global invalidation traffic. In this paper, we characte...

متن کامل

Memory Block Relocation in Cache-Only Memory Multiprocessors

COMA machine is similar to that in a traditional shared memory machine, there are a few aspects that differentiate the AMs from the cache memory in traditional cache-coherent multiprocessors [8]. One important aspect unique to COMA is that the backing store of the AMs in a COMA machine is disks of secondary storage. So, unlike a traditional multiprocessor cache, write-back to the backing store ...

متن کامل

Two proposals for the inclusion of directory information in the last-level private caches of glueless shared-memory multiprocessors

In glueless shared-memory multiprocessors where cache coherence is usually maintained using a directory-based protocol, the fast access to the on-chip components (caches and network router, among others) contrasts with the much slower main memory. Unfortunately, directory-based protocols need to obtain the sharing status of every memory block before coherence actions can be performed. This info...

متن کامل

Reducing Coherence Overhead in Shared-Bus Multiprocessors

To reduce the overhead of cache coherence enforcement in shared-bus multiprocessors, we propose a self-invalidation technique as an extension to write-invalidate protocols. The technique speculatively identi es cache blocks to be invalidated and dynamically determines when to invalidate them locally. We also consider enhancing our selfinvalidation scheme by incorporating read snar ng, to reduce...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000